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COMPOSITIONS AND METHOD S FOR MODIFYING THE REGULATORY 

ACTIVITY OF TGF-fl 



FTELD OF THE INVENTION 

The present invention relates to cell biology and 
5 to methods of modifying the biological activity of cell 
regulatory factors. More specifically, the pres nt 
invention relates to a novel TGF-A-binding glycoprotein. 

Throughout this application various publications 
are referenced within parentheses. The disclosures of 
10 these publications in their entireties are hereby 
incorporated by reference to more fully describe the stat 
of the art to which this invention pertains. 

BACKGROTTWn OF TF F. TNVENTION 

Glycoproteins, in which one or more carbohydrate 
15 units have been attached covalently to the protein by 
posttranslational processing are widely distributed. 
Several secretory proteins, including the immunoglobulins, 
are glycoproteins, as are most components of plasma 
membranes such as cell membrane receptors, where the 
20 carbohydrates can be involved in cell-to-cell adhesion. 

Transforming growth factor fl (TGF-fl) refers to a 
family of multi-functional cell regulatory factors produc d 
in various forms by many cell types (for review see Sporn 
et al,. J - Cell Biol. 105:1039 (1987)). Five distinct 

25 isoforms of TGF-B have been identified. TGF-fl 1 and TGF-B2 
have been characterized in detail. TGF-fl is the subject of 
U.S. Patent Nos. 4,863,899; 4,816,561 and 4,742,003 which 
are incorporated herein by reference. TGF-fl binds to cell 
surface receptors present on various types of cells and is 

30 known to potentiate or inhibit the response of most cells 
to other growth factors, depending on the cell type. TGF-fl 
also regulates differentiation of some cell types, either 
promoting or inhibiting proliferation of the cell. Another 
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marked effect of TGF-fl is the promotion of cellular 
production of extracellular matrix proteins and their 
receptors (for a review see Keski-Oja et al., J. — Cell 
Biochem. 33:95 (1987); Massague, Cell 49:437 (1987); 
5 Roberts and Sporn, "Peptides Growth Factors and Their 
Receptors Springer-Verlag (1989)). 

Notwithstanding the beneficial and essential cell 
regulatory functions served, TGF-fl regulatory activity can 
prove detrimental to its host organism. For example, 

10 whereas growth and proliferation of mesenchymal cells is 
stimulated by TGF-fl, some tumor cells may also b 
stimulated, using TGF-fl as an autocrine growth factor. In 
other cases the inhibition of cell proliferation by TGF-fl 
similarly proves detrimental to its host organism. An 

15 example would be the prevention of new cell growth to 
assist in repair of tissue damage. The stimulation of 
extracellular matrix production by TGF-fl is essential for 
wound healing. However, in some cases, the TGF-fl response 
is uncontrolled and an excessive accumulation of 

20 extracellular matrix results. An example of excessive 
accumulation of extracellular matrix is the n internal ■ 
scarring that occurs in the pathology glomerulonephritis 
and dermal scar tissue formation. 

25 The transforming growth factor-/3 receptor system 

in most mesenchymal and epithelial cells consists of 
several components (Massague, J. Ann, Rev. Cell Biol. 6:597 
(1990); Lin, H.Y. et al., Cell 68:775 (1992); Georgi, L.L. 
et al., Cell 61:635 (1990); Mathews, L.S. et al., Cell 

30 65:973 (1991); Attisano, L. et al., Cell 68::97 (1992); 
Lopez-Casillas et al., Cell 67:785 (1991) and Wang et al., 
Cell 67:796 (1991)), one of which is betaglycan, a 
membrane-anchored proteoglycan. In addition to betaglycan, 
the TGF-/3 receptor system in most mesenchymal and 

35 epithelial cells consists of the type I r ceptor, a 53-kDa 
glycoprotein whose structure has not been d termin d yet, 
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and the type II receptor, which belongs to the protein 
serine/threonine kinase receptor family. Additional cell 
surface TGF-/3-binding proteins, some of which have a more 
restricted distribution, have also been described. 

5 Thus, a need exists to develop compounds that can 

modify the effects of cell regulatory factors such as TGF- 
fl. The present invention satisfies this need and provides 
related advantages as well. 

SUMMARY OF THE INVENTION 

10 The present invention provides a novel purified 

TGF-A-binding glycoprotein. The protein, endoglin, is 
expressed at high levels on human vascular endothelial 
cells. 

Further provided by the present invention are 
15 methods of treating pathologic conditions mediated by TGF-fl 
regulatory activity by contacting the TGF-fl with an 
effective amount of purified endoglin-derived polypeptid 
or any fragment thereof having the ability to bind TGF-fl. 
Thus, intact, native endoglin and soluble fragments thereof 
20 are useful in these methods. This invention provides a 
method of preparing and purifying full length and soluble 
endoglin-derived polypeptide. Isolated nucleic acids 
encoding the novel TGF-B-binding glycoprotein and soluble 
endoglin-derived polypeptides are also provided, as well as 
25 vectors containing the nucleic acids and recombinant host 
cells transformed with such vectors. 

BRTBF DESCRIPTION OF TH E FIGURES 

Figure 1 shows the domain structures of 
betaglycan and endoglin. Shown is a schematic 

30 representation highlighting regions of similarity between 
the linear sequences of betaglycan, an 853-amino acid 
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transmembrane proteoglycan, and endoglin, a disulfide- 
linked transmembrane protein composed of two identical 
subunits of 633 amino acids each. The transmembrane and 
short cytoplasmic regions (dark shaded box) of endoglin 
5 have a high level of sequence similarity to th 
corresponding regions of betaglycan. Two regions of weaker 
similarity are detected in the ectodomains of theee 
proteins (light shaded boxes). Numbers represent the 
percent amino acid sequence similarity between the 
10 indicated domains of betaglycan and endoglin. Closed ovals 
represent positions of cysteine residues. Two putative 
sites for glycosaminoglycan chain attachment in betaglycan 
are indicated. 

Figure 2 shows cell surface TGF-£1 -binding 

15 proteins expressed by HUVEC. Near confluent cultures of 
HUVECs were affinity-labeled by incubation with 100 pM ia5 I- 
TGF-/31 followed by chemical cross-linking with 0.16 mM 
disuccinimidyl suberate. A) Triton X-100 extracts of 
affinity-labeled HUVEC were resolved on SDS-PAGE gels under 

20 reducing (R) or nonreducing (NR) conditions. Lane C 
contains extract from cells affinity- labeled in the 
presence of excess unlabeled TGF-pl. The migration 
position of TGF-/3 receptors I (RI) and II (RII) are 
indicated. Arrow, the major affinity- labeled proteins of 

25 180 kDa and higher molecular mass apparent on nonreducing 
gels. Arrowhead, the affinity-labeled proteins of 110-120 
kDa seen on reducing gels. B) Detergent extracts of 
affinity-labeled HUVEC were resolved under nonreducing 
conditions on a first gel that was then resolved under 

30 reducing conditions in the second dimension as described in 
Cheifetz and Massague, .t. Biol. Chem. 266:20767-20772 
(1991), incorporated herein by reference. The 110-120-kDa 
labeled species migrating of f-the-diagonal are indicated 
( arrowheads ) . 
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Figure 3 shows specific immune-precipitation of 
TGF-/31-endoglin complexes. HUVECs were affinity-labeled 
with 100 pM "'I-TGF-/81 as described in Figure 2. A) 
Detergent extracts of affinity-labeled cells were incubat d 

5 with mAb 44G4 and immune complexes were collected on 
protein G-Sepharose. After washes, equal aliquots of the 
samples were analyzed under reducing (R) or non-reducing 
(NR) conditions by SDS-PAGE (5-8% polyacrylamide gradient 
gels). B) Affinity-labeled HUVEC lysates were maximally 

10 depleted of endoglin by two successive 45 min incubations 
at 4°C with 100 pi of 44G4-IgG-Sepharose. S) supernatant 
after second immunoprecipitation. I) the first 44G4 
inununoprecipitation which contained 83% of the endoglin. 
T) corresponding amount of total extract used for the 

15 depletion experiment. All samples were analyzed under 
nonreducing conditions on SDS-PAGE with the exception of I B , 
which was run under reducing conditions. The migration 
positions of TGF-/3 receptor II (MI), and endoglin monomer, 
dimer, and oligomer are indicated. 

Figure 4 shows that endoglin transiently 
expressed in COS-M6 cells binds TGF-/31. COS-M6 cells were 
transfected with a cDNA encoding full-length L-endoglin 
(Endoglin) or control vector (C) . Cells were affinity- 
labeled with 150 P M » 5 I-TGF-/31 and the detergent extracts 
25 incubated with mAb 44G4 followed by protein G-Sepharose. 
immunoprecipitated proteins were analyzed by SDS-PAGE under 
reducing (R) and non-reducing (NR) conditions and 
visualized by autoradiography. 

Figure 5 shows the specificity of endoglin for 
30 TGF-/3 isoforms assessed in COS cell transf ectants and in 
HUVEC. A) COS-M6 cells transf ected with endoglin vector 
were affinity-labeled with 150 pM *"l-TGF-/il alone or in the 
presence of 1 or 10 nM unlabeled TGF-/J1, -/32 or -03. B) 
BUVEC were affinity-labeled with 100 pM » 5 I-TGF-£1 alone r 
35 in the presence of 5 nM unlabeled TGF-/31 or TGF-/52. 
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Lysates from these cells were immunoprecipitated with MAb 
44G4. Immunoprecipitates were fractionated under reducing 
conditions on SDS-PAGE gels. The region of the gels 
containing monomeric endoglin is shown along with the 
5 migration position of 100-kDa marker. 

Figure 6 shows the restriction enzyme map of 
vector pcNeoSolEND with the 1.7 kb endoglin cDNA insert. 

Figure 7 shows the partial nucleotide and 
predicted amino acid sequences of isolated S-endoglin cDNA. 

10 Nucleotides are numbered on the right. Amino acids ar 
numbered on the left. The predicted signal sequence 
(nucleotides 283-357) and transmembrane region (nucleotid 8 
2042-2116) are boldfaced. The 135 bp insert (underlined) 
contains splicing consensus sequences of donor/acceptor 

15 sites (GT, AG at positions 2134 and 2267) and branch point 
of lariat (CTGAC at position 2234). Nucleotides 372-2021 
and 2322-3073 were found to be identical to the 
corresponding cDNA sequence of endothelial endoglin 
(Gougos, A. and Letarte, M.J., J. Biol. Chem. 265:8361 

20 (1990) and Table 1). 

Figure 8 shows the analysis of the cytoplasmic 
region which reveals the existence of two different forms 
of endoglin. A. Diagram of the cDNA coding for the 
cytoplasmic regions of the two alternative forms of 

25 endoglin. Only the region corresponding to the 3' end 
containing the cytoplasmic domain is depicted. The 
isolated endoglin cDNA (S-endoglin) contains a 135 bp 
insert not present in the previously described sequence (L- 
endoglin) (Gougos, A. and Letarte, M.J., J. Biol. Chem. 

30 265:8361 (1990)). The sequence of the additional 135 bp 
insert is shown in Figure 7. The position of the stop 
codons and the corresponding translated protein sequences 
(thick- bar), are indicated. B. Alignment of the 
cytoplasmic and transmembrane domains of S-endoglin with 
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the corresponding sequ nces of L-endoglin and human 
betaglycan, Moren, A. et al., Biochem. Biophys. Res. 
Commun. 189:356 (1992). Boxes contain identical sequences. 
Numbers indicate the position of the first amino acid in 
5 the whole sequence. Two major regions of identity were 
found. The first region (73% identity) involves residues 
587-617 of S- and L-endoglin and residues 780-810 of 
betaglycan. The second region (74% identity), involves 
residues 634-660 of L-endoglin and residues 823-849 of 
10 betaglycan. The transmembrane region of S-endoglin is 
boldfaced. Dashes have been inserted for purposes of 
alignment. Asterisks indicate the last residue of the 
protein. 

Figure 9 shows the expression of L-endoglin and 

15 S-endoglin in transfectant cells. Mouse fibroblasts were 
transfected with either L-endoglin or S-endoglin cDNA and 
the expression of the endoglin molecule analyzed. A. 
Analysis by cytof luorometry of the endoglin present at the 
cell surface. After trypsinization, cells were stained for 

20 indirect immunofluorescence with the monoclonal antibody 
8E11 (anti-endoglin) . A control staining of endoglin" mock 
transfectants is also shown. B. Immunoprecipitation 
analysis. Cells were metabolically labeled with 

[ 35 S]methionine, lysed and immunoprecipitated with 44G4 

25 (anti-endoglin) or HC1/1 (anti CDllc) monoclonal antibody. 
Monoclonal antibody HCl/1 was included as a negative 
control. Samples were electrophoresed on a 6-12% 
acrylamide gradient gel under nonreducing conditions. C. 
Immunoblotting analysis. Mock (L cells), L-endoglin (L- 

30 Endo) and S-endoglin (S-Endo) cDNA transfected mouse 
fibroblasts and PMA- treated 0937 cells were lysed in Triton 
X-100, and the insoluble material removed by 
centrifugation. Proteins contained in the supernatant were 
electrophoresed on a 6% acrylamide gel under non-reducing 

35 conditions and transferred to nitrocellulose membranes. 
Immunodetection of endoglin was carried out with 44G4 
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(anti-endoglin) monoclonal antibody using a 
chemiluminiscent assay. HC1/1 (anti-CDllc) and X63 
monoclonal antibody were used as negative controls. D. 
Immunoprecipitation analysis of cell surface labeled 
5 endoglin. Mock, L-endoglin (L-Endo) and S-endoglin (S- 
Endo) transfected mouse fibroblasts were " 5 I-labeled, lysed 
and immunoprecipitated with 44G4 monoclonal antibody (anti- 
endoglin). Samples were electrophoresed on a 10% 
acrylamide gel under either reducing (R) or nonreducing 
10 conditions (NR) . 

Figure 10 shows the detection of L-endoglin and 
S-endoglin transcripts by PCR amplification. Samples of 
total RNA from placenta, PMA treated HL-60 cells or PMA- 
treated U937 cells were incubated either in the presence 

15 (+RT) or in the absence (-RT) of reverse transcriptase. 
The generated cDNA samples, together with cDNA from S- 
endoglin (S-Endo) or L-endoglin (L-Endo) clones in pUC13 
and cDNA from an endothelial cell library, were used for 
PCR amplification in the presence of oligonucleotides #14 

20 and #15 specific for S-endoglin (panel A), or 
oligonucleotides #12 and #11 common to both L-endoglin and 
S-endoglin (panel B) . No amplification was observed when 
the RT reaction was omitted, excluding the possibility of 
DNA contaminating the RNA samples. Additional bands below 

25 the specific amplified fragments of L and S endoglin 
probably represent primer-dimer artifacts. 

Figure 11 shows that both S-endoglin and L- 
endoglin bind TGF-fil. Confluent cultures of S-endoglin (L+- 
S) and L-endoglin (L+-L) transf ectants were affinity 

30 labeled by incubation with 100 pM 125 I-TGF-fll alone or in the 
presence of 4 nM unlabeled TGF-J51 , followed by chemical 
crosslinking with disuccinimidyl suberate. All samples 
bound specifically to TGF-fll, as revealed by a fourfold 
ratio between the cpm bound in th absence versus presence 

35 of cold competing ligand; the parental L cells and mock 
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transfectants bound on average 60,000 specific cpm, while 
the endoglin transfectants bound on averag 110,000 
specific cpm. Immunoprecipitates of the L+-S transfectant 
with 44G4-IgG Sepharose contained 7700 cpm on averag 
5 versus 700 cpm with control IgG-Sepharose; immuno- 
precipitates of the L+-L transfectant contained 4300 cpm 
versus 630 cpm for control. These immunoprecipitates wer 
run on a 6-9% acrylamide gradient gel in reducing (R) or 
nonreducing (NR) conditions. The positions of the endoglin 
10 monomer, dimer and oligomer and of molecular weight markers 
are indicated. 

Figure 12 shows the multiple cloning sites of 
pcDNAI/Neo. 



15 



Figure 13 shows the amino acid and nucleotide 
seguence of "L-Endoglin. " 
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TVFTATLED DESCRIPTION OF THE INVENTION 



Endoglin is a homodimeric membrane glycoprotein 
composed of disulfide- linked subunits. Human-derived 
endoglin has been shown to exist in at least two isoforms 
5 expressed from a human cDNA library, an "L-isofonn," of 
about 90 kDa or an -S-isoform" of about 85 kDa, reduced. 
Human endoglin, purified from tissue is shown to be 
composed of two disulf ide-linked subunits each of about 95 
kDa. It is expressed in human pre-erythroblasts, 

10 macrophages, leukemic cells of the lymphoid and myeloid 
lineages and at higher levels in vascular endothelial 
cells. It is also abundant on the syncytiotrophoblast , the 
multinucleated placental layer which constitutes th 
interface with maternal blood and plays an important role 

15 in providing nutrient exchange and immunological protection 
of the fetus (Gougos et al., Tnter. Immunol. 4:83-92 
(1992)). 

Endoglin was first identified on a pre-B leukemic 
cell line, by its reactivity with mAb 44G4 (Quackenbush and 

20 Letarte, .T. Immunol . 134:1276-1285 (1985)). It is present 
at low levels on cells derived from childhood acute 
leukemia cases (ALL) with pre-B lymphoid and myeloid 
phenotype; it is absent from T-ALL (Gougos and Letarte, i. 
Immunol. 141:1925-1933 (1988b); Kreindler et al., Leukemia 

25 and Lymphoma 3:7-18 (1990)). Interestingly, the human 
endoglin gene is localized to chromosome 9q34-qter, likely 
within the region translocated to chromosome 22 in 
Philadelphia chromosome-positive leukemia (Fernandez-Ruiz 
et al., Sylsasai r. e ll Gen. 64:204-207 (1993)). In normal 

30 adult bone marrow, only 3-5% of mononuclear cells express 
endoglin and they bear a pro-erythroblast phenotype; pre-B 
and myeloid precursors do not show detectable levels f 
endoglin (Buhring et al., Leukemia 5:841-847 (1991)). 
Normal B and T lymphocytes and unstimulated monocytes do 

35 not express endoglin but an up-regulation is observed on 
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activated macrophages (Lastres et al., Eur. J. Immunol . 
22:393-397 (1992)). 

Endoglin expression is considerably increased in 
the endothelium of various pathological skin lesions where 
5 endothelial cell proliferation is known to occur (Westphal 
et al., J. Invest. Dermatol. 100:27-34 (1993)). In tumors, 
capillary endothelial cells undergoing active angiogenesis 
also show higher levels of endoglin than resting 
endothelium in adjacent tissue. 

10 Purified human endoglin exists in various 

isoforms and is composed of two disulfide linked subunits 
of Mr=95,000; 90,000; or 85,000, and bears N- and 0-linked 
oligosaccharides. The primary sequence of human endoglin 
is composed of a 25 amino acid signal sequence, an 

15 extracellular domain of about 561 amino acids, a single 
transmembrane region of 25 amino acids and a cytoplasmic 
tail of 14 to 47 amino acid residues (Gougos and Letarte, 
J. Biol. Chem. 265:8361-8364 (1990)). 

A relationship between human endoglin and the 
20 TGF-/3 receptor system was discovered with the molecular 
cloning of the rat TGF-/3-binding proteoglycan, betaglycan 
(also known as the type III TGF-/3 receptor), which revealed 
that the transmembrane domain and the relatively short (43 
amino acid) cytoplasmic tail of this protein wer 
25 remarkably similar (71% amino acid sequence similarity and 
63% amino acid identity) to the corresponding regions in 
endoglin (see Figure 1). The extracellular domains of 
these two proteins show limited homology in primary 
structure, and while endoglin is not a proteoglycan, it 
30 does contain N- and 0-linked oligosaccharides. 



Cloning of the TGF-B receptor II (Lin et al., 
Cell 68:775-785 (1992)) revealed a functional transmembrane 
serine /threonine kinase which had previously been 
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identified as a polyp ptid . of 80 kd, when bound and 
• chemically cross-linked to » 5 I-TGF-B (Cheifetz et al., 1± 
Biol. Chem. 265:20533-20538 (1990); Cheifetz and Massague, 
.t. Biol. Chem. 266:20767-20772 (1991); Massague, J., Sell 
5 69:1067-1070 (1992)). This receptor associates with the 
type I receptor (a 53 kd protein) to transduce the TGF-B 
signals; it is proposed that receptor I needs receptor II 
to bind TGF-B and that receptor II needs receptor I to 
mediate a signal (Laiho et al., J. Biol. Chem. 266:9108- 

10 9112 (1991); Laiho et al., .t. Biol. Chem. 265:18518-18524 
(1990); Wrana et al., Cell 71:1003-1014 (1992)). 
Transfection experiments with a recently cloned receptor I, 
also a serine/threonine kinase, further supports the view 
that receptor I must associate with receptor II to bind 

15 TGF-B (Ebner et al.. Science 260:1344-1348 (1993)). 

The present invention provides a soluble 
endoglin-derived human polypeptide that binds TGF-B. The 
full-length soluble endoglin-derived polypeptide comprises 
a signal sequence that is cleaved during processing, the 

20 561 amino acids of the extracellular domain of the mature 
endoglin polypeptide, an integral membrane protein, which 
consists of about 600 or 633 amino acids in total. Nucleic 
acid sequences encoding the human endoglin polypeptide are 
identified in Table 1 and Figure 13. The nucleic acid 

25 sequences encoding the soluble endoglin-derived 
polypeptides are included within the sequences set forth in 
Table 1 (from about amino acid number 1 to about amino acid 
number 561) and Figure 13. Also provided by this invention 
is a vector having inserted therein the genomic DNA 

30 molecule encoding endoglin. This vector was deposited 
under the provisions of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms 
for the Purposes of Patent Procedure, with Coleccion 
Espanola de Cultivos Tipo (CECT), 46100 Burjasot 

35 (Valencia), Spain on October 21, 1993, under CECT 4475. 
Accordingly, isolated genomic DNA encoding endoglin is 
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within the scope of this invention. 

This invention also provides purified human 
endoglin polypeptides encoding two isofonns differing from 
each other in the cytoplasmic region. The "S-endoglin" has 
5 14 amino acid residues in the cytoplasmic region and th 
»L-endoglin" has 47 amino acid residues in the cytoplasmic 
region. The 586 amino acids spanning the extracellular and 
transmembrane regions in the mature endoglin are identical. 

As used herein, the term "purified" means that 

10 the molecule or compound is substantially free of 
contaminants normally associated with a native or natural 
environment. For example, the mature human proteins can be 
obtained from a number of methods. The methods available 
for the purification of membrane proteins include 

15 precipitation, gel filtration, ion-exchange, reverse-phase, 
and affinity chromatography. Other well-known methods are 

described in Deutscher et al., Guide to Protein 

Purification* Methods in Enz vmoloav Vol. 182, (Academic 
Press 1990), which is incorporated herein by reference. 

20 Alternatively, a purified polypeptide of the present 
invention can also be obtained by well-known recombinant 
methods as described, for example, in Sambrook et al., 
Molecular Cloning; A T.aboratorv Manual 2d ed. (Cold Spring 
Harbor Laboratory 1989), also incorporated herein by 

25 reference. An example of this means for preparing soluble 
endoglin-derived polypeptide is to express nucleic acid 
encoding the soluble endoglin in a suitable host cell, such 
as a bacterial, yeast or mammalian cell, using methods well 
known in the art, and recovering the expressed soluble 

30 protein, again using methods well known in the art. 

For the purpose of illustration, expression of 
soluble endoglin was achieved by excising a full length 2 . 3 
kb endoglin cDNA fragment from pcEXV-L ENDO, Hind III 
linkers were added and, after digestion with Hind III, it 
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was subcloned into the multiple cloning site of pBluescript 
vector (Stratagene) , followed by digestion with Tth 1111. 
(This enzyme cuts the coding region approximately 80 bp 
upstream of the transmembrane region.) 

5 Synthetic, complementary oligomers were 

engineered to contain an in- frame stop codon and a Bam HI 
overhang, and were ligated to the linearized plasmid. 
After digestion with Hind III, and Bam HI, and purification 
of the 1.7 kb fragment, it was ligated to pcDNAI/Neo 

10 (Invitrogen, San Diego, Figure 12). After cloning, the 
clones were screened by restriction enzyme analysis for th 
correct orientation and were found to be in the correct 
orientation. Verification of the cDNA insert was done 
using T7 and SP6 sequencing primers to confirm the presence 

15 of the start and stop codons at the 5' and 3' ends of th 
insert. 

5' End Primer 

TAATACGACTCACTATAG GGAGACCCAAGCTTGGGGAATTCCGTGGACAGCATS 

T7 PRIMER HINDIII ECORI Initiation 

20 3' End Primer 

AAGACCGTCTAGACGGATCCACTAG CTATAGTGTCACCTAAATG 

BAMHI SP6 PRIMER 

To express the construct, CHO-kl cells (ATCC) 
were transfected with pCDNA/Neo to determine optimal 

25 conditions required to generate stable transf ectants 
resistant to G418. Five (5) x 10* cells electroplated with 
5 jig of DNA at a voltage of 300 volts and a time constant 
of 17.9 msec, using the 960 uF capacitator (Biorad Gene 
Pulse) yielded the most number of stable transf ectants with 

30 minimal cell death. 

The following flow chart illustrates a m ans to 
express nucl ic acid encoding soluble endoglin. 
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SOLUBLE ENDOGLIK 



plasmid pcEXV-L ENDO 
Eco Rl digest 



Isolate 2.3 kb fragment 
Add Hind III linker & Hind III digest 



Subclone into pBluescript 
MCS-Hind III 
& selection on X-gal 



pBluescript-L Endo 
Digest with Tth 1111 to linearize 



Add complementary oligomers creating 
2 5 stop codon & Bam HI overhang 

TCTAGACG 
AGATCTGCCTAG 



30 



35 



Hind III Digest 

Isolate 1.7 kb Hind III-BamHl fragment 

Ligate with dephosphorylated 
Hind III-Bam Hi fragment of vector 
pcDNAI/Neo 



Transform E. coli MC1061/P3 
selection on Tetracyclin & Ampicillin 
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The solubl polypeptide and biologically active 
fragments thereof can also be produced by chemical 
synthesis. Synthetic polypeptides can be produced using 
Applied Biosystems, Inc. Model 430A or 431A automatic 
polypeptide synthesizer and chemistry provided by the 
manufacturer. The soluble polypeptide can also be isolat d 
directly from cells which have been transformed with the 
expression vectors described below in more detail. 

As used herein, endoglin-derived polypeptide 
means a human polypeptide having the amino acid sequence 
substantially the same as the 633 amino acid sequence shown 
in Table 1, or the 658 amino acid sequence shown in Figure 
13, or an active fragment thereof. As used herein the term 
-soluble endoglin-derived polypeptide" refers to a soluble, 
biologically active fragment of the human endoglxn 
polypeptide expressed by the extracellular domain of the 
nucleic acid. As used herein, an -active fragment- or 
-biologically-active fragment" refers to any portion of an 
endoglin polypeptide that binds to TGF-fl. Methods of 
20 determining whether a polypeptide can bind TGF-fl are well 
known to those of skill in the art, for example, as set 
forth herein. 

The invention also encompasses nucleic acid 
molecules which differ from that of the nucleic acid 

25 molecule shown in Table 1, e.g., the sequence shown in 
Figure 13, but which produce the same phenotypic effect. 
These altered, but phenotypic ally equivalent nucleic acid 
molecules are referred to as -equivalent nucleic acids." 
This invention also encompasses nucleic acid molecul s 

30 characterized by changes in non-coding regions that do not 
alter the phenotype of the polypeptide produced therefrom 
when compared to the nucleic acid molecule described above. 
This invention further encompasses nucleic acid molecules 
which hybridize to the nucleic acid molecule of the subject 

35 invention or its complem nt. As used herein, the term 
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"nucleic acid" encompasses mRNA and cRNA as well as singl 
and double-stranded genomic DNA, DHA and cDNA. In 
addition, as used herein, the term "polypeptide" 
encompasses any naturally occurring allelic variant 
5 thereof, such as S-endoglin and L-endoglin, as well as man- 
made recombinant forms. 

This invention provides an isolated nucleic acid 
molecule encoding a soluble ehdoglin-derived polypeptide. 
As used herein, the term "isolated nucleic acid molecule" 

10 means a nucleic acid molecule that is in a form that does 
not occur in nature. One means of isolating a human 
endoglin nucleic acid is to probe a human cDNA expression 
library with a natural or artificially designed antibody t 
endoglin, using methods well known in the art (see Gougos, 

15 A. et al., J. Biol Chem. 265:8361 (1990)) and the Examples 
set forth below. DNA and cDNA molecules which encode human 
endoglin polypeptides can be used to obtain complementary 
genomic DNA, cDNA or RNA from human, mammalian or other 
animal sources. Isolated genomic DNA also is encompassed 

20 by this invention as described above. It is isolated by 
using the nucleic acid sequences of this invention and 
methods well known to those of skill in the art as 
described in Sambrook et al., supra. 

The invention further provides an isolated 
25 nucleic acid molecule operatively linked to a promoter of 
RNA transcription, as well as other regulatory sequences. 
As used herein, the term "operatively linked" means 
positioned in such a manner that the promoter will direct 
the transcription of RNA off of the nucleic acid molecule. 
30 Examples of such promoters are SP6, T4 and T7. Vectors 
which contain both a promoter and a cloning site into which 
an inserted piece of DNA is operatively linked to that 
promoter are well known in the art. Preferable, these 
vectors are capable of transcribing RNA in vjtro or in 
35 vivo . Examples of such vectors are the pGEM series 
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This invention provides an expression or 
replication vector comprising this isolated nucleic acid 
molecule such as DNA, cDNA or RNA encoding a soluble 
5 endoglin-derived polypeptide. Examples of vectors are 
viruses, such as bacteriophages, baculoviruses and 
retroviruses, cosmids, plasmids (such as pcEXV-2) and other 
recombination vectors. Nucleic acid molecules are inserted 
into vector genomes by methods well known in the art. For 
10 example, insert and vector DNA can both be exposed to a 
restriction enzyme to create complementary ends on both 
molecules that base pair with each other and which are then 
joined together with a ligase. Alternatively, synthetic 
nucleic acid linkers can be ligated to the insert DMA that 
15 correspond to a restriction site in the vector DMA, which 
is then digested with a restriction enzyme that recognizes 
a particular nucleotide sequence. Additionally, an 
oligonucleotide containing a termination codon and an 
appropriate restriction site can be ligated for insertion 
20 into a vector containing, for example, some or all of the 
following: a selectable marker gene, such as neomycin gene 
for selection of stable or transient transf ectants in 
mammalian cells; enhancer /promoter sequences from the 
immediate early gene of human CMV for high levels of 
25 transcription; transcription termination and RNA processing 
signals from SV40 for mRNA stability; SV40 polyoma origins 
of replication and ColEl for proper episomal replication; 
versatile multiple cloning sites; and T7 and SP6 RNA 
promoters for in yit£o transcription of sense and anti- 
sense RNA. Other means are available. 

Also provided are vectors comprising a DNA 
molecule encoding an endoglin-derived polypeptide, or 
soluble fragment thereof, adapted for expression m a 
bacterial cell, a yeast cell, a mammalian cell and other 
animal cells. The vectors additionally comprise the 
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regulatory elements necessary for expression of the DNA in 
the bacterial, yeast, mammalian or animal cells so located 
relative to the DNA encoding soluble endoglin polypeptide 
as to permit expression thereof. Regulatory elements 
5 required for expression include promoter sequences to bind 
RNA polymerase and transcription initiation sequences for 
ribosome binding. For example, a bacterial expression 
vector includes a promoter such as the lac promoter and for 
transcription initiation the Shine-Dalgarno sequence and 

10 the start codon AUG (Sambrook et al. supra. 1989). 
Similarly, a eucaryotic expression vector includes a 
heterologous or homologous promoter for RNA polymerase II, 
a downstream polyadenylation signal, the start codon AUG, 
and a termination codon for detachment of the ribosome. 

15 Such vectors can be obtained commercially or assembled by 
the sequences described in methods well known in the art, 
for example the methods described above for constructing 
vectors in general. Expression vectors are useful to 
produce cells that express the polypeptide. 



20 



This invention provides a mammalian cell 
containing a cDNA molecule encoding an endoglin-deriv d 
polypeptide or a soluble fragment thereof. An example is 
a mammalian cell comprising a plasmid adapted for 
expression in a mammalian cell. The plasmid has a cDNA 
25 molecule encoding an endoglin-derived polypeptide and the 
regulatory elements necessary for expression of the 
polypeptide. Various mammalian cells may be utilized as 
hosts, including, for example, mouse fibroblast cell 
NIH3T3, CHO cells, HeLa cells, Ltk- cells, etc. Expression 
plasmids such as those described supra can be used to 
transfect mammalian cells by methods well known in the art 
such as calcium phosphate precipitation, DEAE-dextran , 
electroporation or microinjection. 



30 



This invention provides a pharmaceutical 
35 composition containing a pharmaceutical carrier and any of 
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a purified polypeptide, a purifi d soluble polypeptide, an 
active fragment thereof, or a purified, mature protein and 
active fragments thereof, alone or in combination with each 
other. These polypeptides or proteins can be recombinantly 
5 derived, chemically synthesized or purified from nativ 
sources. As used herein, the term -pharmaceutically 
acceptable carrier" encompasses any of the standard 
pharmaceutical carriers, such as a phosphate buffered 
saline solution, water and emulsions such as an oil/water 
10 or water/oil emulsion, and various types of wetting agents. 
Any of these pharmaceutical compositions are useful in the 
methods described below or for the preparation of a 
medicament for treating the conditions described below. 

Also provided are antibodies having specific 

15 reactivity with the endoglin-derived TGF-fl-binding 
polypeptides of the subject invention, such as anti- 
endoglin antibody 44G4 (Quackenbush, E.J., and Letarte, 
M.J., J. Immunol. 134:1276-1285 (1985)) or any antibody 
having specific reactivity to a TGF-/3-binding endoglin 

20 polypeptide. Active fragments of antibodies are 

encompassed within the definition of -antibody." The 
antibodies and fragments of the invention can be produced 
by any method known in the art. For example, polyclonal 
and monoclonal antibodies can be produced by methods well 

25 known in the art, as described, for example, in Harlow and 
Lane, a^ihnrfies: & T. a hnr a torv Manual (Cold Spring Harbor 
Laboratory 1988), which is incorporated herein by 
reference. The polypeptide can be used as the immunogen in 
generating such antibodies. Altered antibodies, such as 

30 chimeric, humanized, CDR-grafted or bifunctional antibodies 
can also be produced by methods well known to those skilled 
in the art. Such antibodies can also be produced by 
hybridoma, chemical synthesis or recombinant methods 
described,* for example, in Sambrook et ml., supra . The 

35 antibodies can be used for determining the presence or 
purification of the endoglin-derived polypeptide or soluble 
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fragment ther of, of the present invention. With respect 
to the detecting of such polypeptides, the antibodies can 
be used for in vitro diagnostic methods to determxne the 
presence of endoglin or in yiy£ imaging methods. 

Immunological procedures useful for in vitro 
detection of the target soluble endoglin-derived 
polypeptide in a sample include immunoassays that employ a 
detectable antibody. Such immunoassays include, for 
example, ELISA, Pandex microf luorimetric assay, 
agglutination assays, radioimmunoassays, flow cytometry, 
serum diagnostic assays and immunohistochemical staining 
procedures which are well known in the art. An antibody 
can be made detectable by various means well known in the 
art. For example, a detectable marker can be directly or 
indirectly attached to the antibody. Useful markers 
include, for example, radionuclides, enzymes, fluorogens, 
chromogens and chemiluminescent labels. 

This invention provides a method of modifying a 
biological function mediated by the regulatory activity of 
20 TGF-A which comprises contacting a suitable sample 
containing TGF-A with an effective amount of a biologically 
active endoglin-derived polypeptide, for example soluble 
endoglin, or a pharmaceutical composition described above. 

As used herein, "an effective amount" refers to 
25 an amount of the polypeptide sufficient to bind to TGF-fl 
and thereby prevent or inhibit its regulatory activity. 
This method is especially useful for modifying the 
regulatory activity of TGF-JS1 or TGF-B3 . Examples of 
regulatory activities include, but are not limited to 
stimulation of cell proliferation, cell growth inhibition, 
promotion of extracellular matrix proteins, and regulation 
of immune functions. TGF-fl is known to be a potent 
chemoattractant for monocytes and can induce IL-1, TNF-a, 
TGF-fl and surface Fc Y RIH, all of which are involved in the 
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inflammat ry response. Conversely, TGF-B can deactivate 
macrophages by inhibiting the antimicrobial activity and 
the superoxide anion generation, and induce suppression of 
class Il-restricted Ag presentation by macrophages. 

5 The method can be practiced in yi£rs or in vivo . 

If the method is practiced in vitro, contacting is effected 
by incubating the sample with a polypeptide, a protein or 
a pharmaceutical composition as described above. 

in vitro the novel nucleic acid molecules and 
10 antibodies of this invention are useful to detect and 
quantify the amount of TGF-B in a sample isolated from a 
subject, such as a human patient. The detection of TGF-B 
is useful to monitor the progression of a disease related 
to overexpression of TGF-B, e.g., glomerulonephritis. 

15 However, in a preferred embodiment the contacting 

is effected is Yiyo by administering a polypeptide, a 
protein or a pharmaceutical composition, as describ d 
above, to a subject, e.g., a human patient. 

Methods of administration are well known to those 
20 of skill in the art and include, but are not limited to 

administration orally, intravenously or parenterally . 

Administration will be in such a dosage such that the 

regulatory activity is effectively modified. 

Administration can be effected continuously or 
25 intermittently such that this amount is effective for its 

intended purpose. 

This invention also provides a method of treating 
a pathologic condition caused by a TGF-B-regulated activity 
comprising contacting the TGF-B with any of a purified 
30 soluble endoglin-derived polypeptide, an active fragment 
thereof, an endoglin-derived polypeptide or an active 
fragment ther of. The TGF-B is bound with said polypeptide 



WOW/10187 PCT/US93/10307 

24 

to thereby treat the pathologic condition mediated by TGF-B 
regulatory activity. As used herein, "pathologic 
conditions" refers to any pathology arising from TGF-B- 
induced regulatory activity, for example, inflammation, 
5 rheumatoid arthritis, inflamed skin lesions, scar tissue 
formation, lung fibrosis, liver fibrosis, atherosclerosis, 
and glomerulonephritis. Growth and proliferation of 
mesenchymal cells is stimulated by TGF-B, however som 
tumor cells may also be stimulated thus using TGF-fl as an 
autocrine growth factor. An example of inhibitory 
conditions are the prevention of new cell growth to assist 
in repair of tissue damage, for ulceration and 
immunosuppression. The stimulation of extracellular matrix 
production by TGF-fl is essential for wound healing. 
However, in some cases, the TGF-fl response is uncontrolled 
and an excessive accumulation of extracellular matrix 
results. An example of excessive accumulation f 
extracellular matrix is glomerulonephritis. An additional 
example of a pathology is cancer. 
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In one embodiment, the method is practiced by 
administering to a subject, e.g., a human patient or a 
mammal, an effective amount of a purified endoglin protein 
or an endoglin-derived soluble polypeptide or a 
biologically active fragment thereof , or the pharmaceutical 
25 composition described above. Methods of administration are 
outlined supra . 



Also provided by this invention is a method f 
inhibiting the activity of endoglin by contacting endoglin 
with an effective amount of a polypeptide capable of 

30 binding to endoglin to bind endoglin, thereby inhibiting 
the activity of endoglin. As used herein, the term "a 
polypeptide capable of binding to endoglin" means any 
substance capable of forming a complex with endoglin, for 
example, TGF-fl 1 or TGF-fl 3, or an active fragment thereof. 

35 An active fragment is an amino acid sequence corresponding 
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to a fragment of TGF-fll or TGF-B3 that retains th ability 
to bind endoglin. Methods of making such fragm nts are 
well known to those of skill in the art as are methods of 
determining the binding activity of the fragments. Also 
encompassed by this invention are polypeptides that retaxn 
their activity to bind to endoglin, but no longer medxate 
the biological response corresponding to the binding of a 
functional ligand to the receptor is destroyed. Thus, 
these -mutated" polypeptides can act as antagonists to the 
biological function mediated by the ligand to endoglxn by 
blocking the binding of normal, functioning ligand. to 
endoglin on the cell* 

This invention also encompasses the use of the 
compositions defined above for the preparation of 
medicaments to modify a biological function regulated by 
TGF-B . These biological functions are described above xn 
detail. 

It is understood that modifications which do not 
substantially affect the activity of the various molecul s 
of this invention are also included within the def initxon 
of said molecules. 

The following examples are intended to illustrate 
but not limit the present invention. 

A. ISOLATION OF HUMAN ENDOGLIN PROTEIN 
AND NUCLEIC ACID ENCODING SAME 

EXAMPLE I 
CeU ^Hnre and Transf ections 

Human umbilical vein endothelial cells (HUVEC, 
CRL 1730, ATCC) were maintained in a-minxmal essential 
30 media supplemented according to supplier's instructions or 
prepared from umbilical veins as previously described 
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(Gougos, A. et al., J. Immunol. 141:1925 (1988)). Similar 
results were obtained using cells from either source. COS- 
M6 cells, maintained in Dulbecco's modified Eagle's medium 
supplemented with 10% bovine serum, were transfected with 
5 a cDNA encoding full-length endoglin ligated into the EcoRI 
site of the mammalian expression vector pcEXV (Miller, J. 
et al., Jf- Evo. Med. 164:1478 (1986)) or with a control 
vector without cDNA insert (pcMV5; Lopez-Casillas, F. et 
al., Cell 67:785 (1991)) by the DEAE-dextran-chloroquine 
10 procedure (Seed, B. , et al., p.n.a.S. USA 84:3365 (1987)). 
Twenty- four (24) hours post-trans fection, cells were 
trypsinized and reseeded into multicluster dishes and 
allowed to grow an additional 48 hours before being 
affinity-labeled with " S I-TGF-/31 as described below. 

15 EXAMPLE II 

Receptor Affinity Labelin g and Inmmnoprecipitation 

TGF-/31 and TGF-/32 were purchased from R&D 
Systems (Minneapolis, MN) and TGF-/33 was obtained from 
Oncogene Science (Manhassett, NY). " S I-TGF-01 used in these 

20 studies was prepared by the chloramine-T method as 
previously described (Cheifetz, S. et al., J. Biol. Chem. 
265:20533 (1990)) or purchased from Amersham Corp.; both 
preparations gave identical results. The conditions for 
affinity labeling cell monolayers with " 5 I-TGF-pl and 

25 disuccinimidyl suberate (Pierce Chemical Co.) have been 
described previously (Massague, J., Methods — Enzymol . 
146:174 (1987)). The concentrations of "*I-TGF-01 and 
competing unlabeled ligands used for each experiment are 
indicted in the figure legends. Triton X-100 extracts of 

30 the affinity- labeled cells were either analyzed directly on 
sodium dodecyl sulf ate-polyacrylamide gels (SDS-PAGE) or 
first incubated with monoclonal antibody (mAb) 44G4 
directed against human endoglin (Quackenbush, E.J. et al., 
J. Immunol. 134:1276 (1985)) or with control antibody (see 

35 below). For immunoprecipitations , detergent extracts were 
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diluted with an equal volume of phosphate-buffered salin 
containing 1% Triton X-100 and precleared by incubation for 
20 min at 4»C with protein G-Sepharose (Pharmacia LKB 
Biotechnology Inc.) prior to overnight incubation at 4- C 
5 with mAb 44G4. Immune complexes were collected by 
incubation with protein G-Sepharose for 1 hour at 4« C. 
For some experiments, mAb 44G4 was used coupled to 
Sepharose. The immunoprecipitates were washed three times 
(saline with 1% Triton X-100) and then resolved by SDS-PAGE 
10 in the presence or absence of dithiothreitol (DTT) and 
visualized by autoradiography. Irrelevant mAb (44D7) us d 
in control experiments to monitor specificity of th 
immunoprecipitations did not immunoprecipitate any 
affinity-labeled bands. 

15 EXAMPLE III 

snfi-PAGE an H ?n-Gel Analysis 

Analysis of the affinity-labeled profile of HUVEC 
revealed that, like vascular endothelial cells from other 
sources, these cells have little or no betaglycan, which 

20 characteristically migrates as a diffuse band between 200 
and 400 KDa on reducing SDS-PAGE (Figure 2A) . Instead, 
HUVEC expressed a disulf ide-linked cell surface protein 
that, together with TGF-/3 receptors I and II, was affinity- 
labeled by crosslinking with »*I-TGF-/31. Receptors I and 

25 II were detected in HUVEC as labeled complexes of 
approximately 65 KDa and 100 KDa, respectively, which is 
similar to the size of these labeled receptors reported for 
other human cell lines. Comparison of the relative 
migration of the affinity-labeled proteins fractioned on 

30 SDS-PAGE revealed that the major affinity-labeled proteins 
of HUVEC migrated between 95-120 KDa on reducing gels 
whereas on non-reducing gels the major affinity-labeled 
proteins migrated between 100-110 KDa (presumed to be 
receptor II) and at 180 KDa and above (endoglin) (Figure 

35 2). This patt rn indicated the presence of disulfide- 
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linked TGF-/3-binding proteins. 

Resolution of these disulfide-linked TGF-fll 
binding proteins on two-dimensional gels (Figure 2B) 
confirmed that the disulfide-linked complexes (probably 
5 dimers and higher order oligomers) contained subunits of 
approximately 95 KDa (value estimated by subtracting the 
cross-linked TGF-/31 monomer mass 12.5 KDa from the reduced 
110 KDa affinity-labeled complex). Together with the typ 
II receptor, the disulfide-linked TGF-fll-binding proteins 
10 are the major affinity-labeled species expressed by HUVEC. 

EXAMPLE IV 

Tnmrnnoor^pitatiP T. with ant i -endoqljn mAb 

To determine whether the disulfide-linked TGF-B- 
binding protein on endothelial cells was endoglin, 

15 affinity-labeled HUVEC extracts were immunoprecipitated 
with monoclonal antibody (mAb) 44G4 which is specific for 
human endoglin (Georgi, L.L. et al., Cell 61:635 (1990); 
MacKay, K. et al., t. Chem. 266:9907 (1992); Merwin, 
j.r. et al., *™. J« Pathol. 138:37 (1991)). 

20 Electrophoretic analysis of these immunoprecipitates 
revealed a labeled protein complex whose subunit structure 
was similar to that of endoglin (Figure 3A) . Thus, under 
reducing conditions, a major affinity- labeled band of 
approximately 110 KDa was seen which migrated as complexes 

25 of 180 KDa and greater than 200 KDa when analyzed under 
non-reducing conditions. The higher order oligomers might 
contain multiple endoglin molecules crosslinked by TGF-fll, 
itself a disulfide-linked dimer. Repeated immuno- 
precipitation with 44G4-IgG-Sepharose completely depleted 

30 these labeled species from cell extracts (Figure 3B). No 
affinity-labeled bands were immunoprecipitated from three 
other human cell lines (A549, Hep G2, MCF-7), which lack 
endoglin and were used as negative-controls for these 
experiments. Monoclonal antibodies specific to human 
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endoglin thus demonstrate that endoglin is a major TGF-p- 
binding protein in human vascular endothelial cells. 

EXAMPLE V 

Ectoeic Bjcpression, o f Kndoalin in Cells 

5 The identity of this dimeric TGF-0-binding 

protein of HUVEC with endoglin was confirmed by ectopically 
expressing the full-length endoglin cDNA in COS monkey 
kidney cells. After affinity-labeling with » 5 I-TGF-/31, a 
labeled species with the characteristics of endoglin could 
10 be specifically precipitated by mAb 44G4 only from the 
detergent extracts of endoglin transf ectants (Figure 4). 
Differences in glycosylation likely account for the smaller 
size of endoglin expressed in COS cells relative to 
endogenous endoglin of HUVEC. 



15 



B. ISOLATION OF S- AND L-ENDOGLIH 

EXAMPLE VI 
^n«i rinnino and Seou^^^na of Endoglin 

Approximately 2.5 X 10 s clones from a XgtlO cDNA 
library prepared from PMA-treated myelomonocytic human cell 

20 line HL60 (Corbi, A.L. et al. , BMBO J- 6:4023 (1987)). 
(50,000 pfu/150 mm dish) were screened with 700 bp PstI 
fragment from endoglin cDNA (Gougos, A. and Letarte, M.J., 
■t. Riol. Chem. 265:8361 (1990)). Thirteen hybridizing 
clones were isolated after three rounds of plaque 

25 purification with sizes ranging from 2.4-3.1 kb. The 
longest clone (clone 3.3; 3073 bp) was subcloned into P UC13 
and sequenced using the dideoxy chain termination method 
(Sanger, F. et al., PNASJJSA. 74:5463 (1977)). 



30 



Clone 3.3 in pOC13 was digested with BbrPI and 
BamHI. Endoglin fragment was made blunt and inserted into 
the mammalian expression v ctor pcEXV (Mill r, J. and 
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Germain, R. , J. Exo. Med. 164:1478 (1986)), yielding pcEXV- 
EndoS. The lack of leader sequence in the cDNA (Table I) 
(Gougos, A. and Letarte, M.J., J. Biol. Chem. 265:8361 
(1990) was overcome by the construction of pcEXV-EndoL. 
5 pcEXV-EndoS was digested with Mlul/BamHI and ligated to the 
563 bp Mlul/BamHI fragment specific to endoglin cDNA (Table 
I), resulting in pcEXV-EndoL. Transfectants were generated 
by cotransf ection of either pcEXV-EndoS or pcEXV-EndoL with 
psV2neo into mouse L cells. After G418 selection, 
10 endoglin-positive clones were isolated. In the same 
experiment, endoglin-negative clones were selected as mock 
transfectants. 

EXAMPLE VII 

Antibodies . Immunofluoresc ence . Immunoprecipitations 
15 and Immunoblottinq 

Endoglin- specif ic monoclonal antibodies used were 
8E11 (Lastres, P. et al., F"t- Immunol. 22 : 393 ( 1992 ) ) 
of the IgM class and 44G4 (Gougos, A. and Letarte, M. , Jj_ 
Immunol. 141:1925 (1988)) of the IgGl subclass. Control 

20 monoclonal antibodies were HC1/1 (anti-CDllc) of the IgGl 
subclass and X63. FCM and immunoprecipitation analyses 
were performed as described in Lastres, P. et al., E^r« J ^ 
Immunol . 22:393 (1992), incorporated herein by reference. 
For immunoblotting studies, 5 X 10* cells were lysed in 250 

25 pi of 1% Triton X-100, 1 mM PMSF in PBS for 30 minutes at 
4°C. Insoluble material was removed by centrifugation at 
100,000 x g for 1 hour in a Beckman TL100 centrifuge. 
Proteins contained in the supernatant were separated by 
SDS-PAGE under nonreducing conditions using a minigel 

30 system and transferred to nitrocellulose membranes (Bybond, 
Amersham) . Membranes were incubated first with a blocking 
solution (10% fetal calf serum, 0.5% Tween-20, 1 M glucose 
and 10% glycerol in PBS), followed by incubation with the 
monoclonal antibody 44G4. The presence of endoglin was 

35 rev aled using a chemiluminescence assay (ECL detection 
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kit, Amersham) . 

EXAMPLE VIII 



10 



15 



20 



25 



30 



Revers 



P Transc rii>i-Hr™-Polvine ™^ Chain Reaction (RT-PCR) 



Total cellular RNA was isolated using guanidinium 
isothiocyanate and a cesium chloride ultracentrif ugation 
step (Chirgwin, J.M. et al. f Biochemistry 18:5294 (1979)). 
Poly (A)+ RNA was purified by oligo (dT) affinity 
chromatography (Maniatis, T. et al. , "Molecular Cloning: 
A Laboratory Manual" Cold Spring Harbor Laboratory, Cold 
Spring Harbor (1982) . Single- stranded cDNA was synthesized 
from poly (A+) RNA (0.5 pg/40 pi reaction), using AMV 
reverse transcriptase. Five microliters of cDNA was used 
for a 50 jil PCR reaction (Gilliland, G. et al., Ini [Innis, 
M.A., Gelfand, D.H., Sninsky, J.J. and White, T.J. (Eds.)) 
-PCR Protocols" Academic Press, San Diego 1990, p. 60.). 
Oligonucleotide primers used for amplification were E#12 
(nucleotides .1945-1964), EIUR (reverse complement of 
nucleotides 2475-2494), E#14 (nucleotides 2136-2158), and 
E#15 (reverse complement of nucleotides 2247-2273). 
Amplifications were performed in 1 X Taq buffer (Promega) 
with 0.2 mM each dATP, dCTP, dGTP, dTTP at 0.25 pH 
concentration of each primer and 0.25 U/50 pi of Taq DMA 
polymerase (Promega). Amplification was carried out in a 
thermal cycler as follows: 5 minutes at 95'C; 35 cycles of 
45 seconds at 94«C, 45 seconds at 54«C, and 1 minute at 
72°C for oligonucleotide pair E#12-E#11R or 35 cycles of 45 
seconds at 94»C, 45 seconds at 68«C and 1 minute at 72«C 
for pair E#14-E#15 and then 10 minutes at 72°C. Contr 1 
reactions, identical except for the omission of reverse 
transcriptase, were performed simultaneously. 
Amplification products were analyzed by agarose gel 
electrophoresis and ethidium bromide staining. 
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EXAMPLE IX 
Beceotor Affinity Labeling 

TGF-B1 was purchased from WD Systems 
(Minneapolis, MN) and » S I-TGF-B1 was obtained from Amersham 
5 (Oakville, Canada) at a specific activity of 2000 Ci/mmol. 
Parental mouse L cells, transfectant mouse L cells 
expressing S-endoglin or L-endoglin, and mock transfectant 
L cells were grown at confluence (5 X 10* cells per plate) 
and incubated with 100 pM of » 5 I-TGF-B1 for 4 hours with and 

10 without 4 nM of competing unlabeled TGF-fll. Cells were 
washed and cross-linked at a final concentration of 0.16 mM 
disuccinimidyl suberate (DSS) (Pierce Chemical Co.) in a 
buffer containing 128 mM HaCl, 5 mM KC1, 5 mM MgS0 4 , 1.2 mM 
CaCl„ 50 mM Hepes, pH 7.5 for 15 minutes at 4«C. Cells 

15 were washed four times and solubilized directly on the 
petri dishes with a minimum of solubilization buffer 
containing 1% Triton X-100 and proteolytic inhibitors as 
described previously (Massague, J., Methods Enzymol. 
146:174 (1987)). The extracts were subjected to 

20 immunoprecipitation followed by SDS-PAGE analysis. 

RESULTS 

Tcnlntion and rharaeteri ^Hnn of Endogljn cDNA Clones; 
-Mrnvtif icaiHnn of a Cytoplasmic Variant 

Full-length cDKA clones were derived from a XgtlO 
25 library prepared from PMA-treated HL60 cells. Clone number 
3.3 was selected for further characterization due to the 
large size of the insert. The partial sequence of the 3073 
bp cDNA insert from clone 3.3 is depicted in Figure 7. 
This clone contains 281 bp of 5' untranslated region 
30 followed by an open reading frame of 1875 bp. The 
predicted protein sequence shows that the endoglin leader 
peptide contains 25 amino acids (aa) , followed by 561 
residu s at the extracellular portion and a transmembrane 



PCT/US93/10307 

WO 94/10187 

33 

•«« S nannina 25 amino acids as expected (Gougos, A. and 
VI Z e To ^US^ 265:8361 (1990,). However, 
Letarte M.J., ^ - within th 

, 3^7 (Figure 8). The first 21 nucleotides of the insert 
5 4 i fX with the preceding seguence and 

sequence of 7 amino acids. This ^ 

10 SSr^^i«- (Cougos, A. and -arte - 
0,^^ 265:8361 (1990,) (Figure 8, , suggest ing the 
existence of two alternative forms of endoglin. The 
existence onrioa i in in the myelomonocytxc cell 

predominant form of endoglw in tne y 

Line acid cytoplasmic tails, and ^YT'^esosctlvX 
are referred tc .. S-e»do,lin and L-endoglm. respectively. 



Fnrms of endoc 



(0 To characterise further the two ^"mative 

for™, independent constructs corresponding to the short 
and lone forms of endoglin were inserted rnto the 
and ions ions tta nsfected into sioose 

egression vector pcEXV and L-.ndoglin 
fibroblast. (Frgnre 9). Both, S e g determined by 

25 r;pit«z ie Tn\rt h L c :"et.bouo - * 

1 fectants followed by immunoprecipitation revealed a 
.,„, OT - a 160-kDa (S-endoglin) protein 

specifically 9 distin ct size of S- and L- 

30 antibody (Figure 9). The di ^ 
, nrt1 i n i n the transfectants was 
endoglin in analysi8 under n on-reducing conditions 
immunoblotting analysis unuc , th 

(Piour. 9). In these experiments, L-endoglrn showed the 
amf" a. the endoglin detected on U937 cells, .nggestxn, 
35 that this is the predominant form in th. pro-onocytrc cell 
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line. 

Cell surface radiolabeling of the transfectants 
followed by immunoprecipitation demonstrated that both 
isoforms are expressed as disulfide linked homodimers 
5 (Figure 9), indicating that cysteine residue present in the 
extracellular portion are mediating the interchain 
disulfide bond. 

Differential expr ession 
of S-endoglin and T.-endoolin mRNA 

10 The individual expression of S- and L-endoglin 

mRNA was analyzed by RT-PCR. Amplification with two 
primers derived from the unique 3' region of S-endoglin 
cDNA generated the expected 137 bp product on S-endoglin 
transfectants and on PMA-treated monocytic lines, 

15 endothelial cells and placenta (Figure 10), demonstrating 
the existence of this isoform in several cell types. When 
primers common to L-endoglin and S-endoglin were used, the 
L-endoglin specific fragment of 411 bp could be amplified 
on PMA-treated monocytic cell lines, placenta and 

20 endothelial cells, whereas the S-endoglin fragment of 546 
bp was only amplified on control S-endoglin transfectants 
(Figure 10). When competitive templates are amplified by 
PCR, an abundant template can suppress the amplification of 
a less abundant one (Gilliland, G. et al., Uki [Innis, 

25 M.A., Gelfand, D.H., Sninsky, J.J. and White, T.J. (Eds.)) 
"PCR Protocols", Academic Press, San Diego 1990, p. 60). 
Therefore, these results indicate that L-endoglin is the 
predominant form and S-endoglin is expressed at lower 
levels in these cell types. This is in agreement with the 

30 presence in PMA-treated U937 cells of an endoglin molecule 
of Mj similar to that observed on the trans fectant L- 
endoglin (Figure 9), assuming similar glycosylation levels 
on both cell types. 
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Igendg glia and S- *-"d oa lin bind to TGF-fil 

Since endoglin has been found to be a component 
of the receptor system for TGF-B, Cheifetz, S. et al., J. 
H4nl. Chem. 267:19027 (1992), it was of interest to analyze 

5 the ability of each isoform to bind this ligand. 
Transfectants expressing short and long forms of endoglin 
were compared to the parental L cells and to the mock 
transfectants for their ability to bind TGF-fl 1 . 
immunoprecipitation analysis allowed the identification of 

10 the TGF-Bl-endoglin complex (Figure 11). Under non- 
reducing conditions, the dimers of endoglin were seen as 
radiolabeled bands of M, 170 kDa (short form) and 175 kDa 
(long form). An oligomer of endoglin is seen in both cases 
migrating with a M, > 270 kDa. A similar complex was also 

15 observed previously in cross-linking experiments with human 
endothelial cells and might represent endoglin molecules 
cross-linked by TGF-fll, itself a dimer (Roberts, A.B. and 
Sporns, M.B. In: Sporn, M.B. and Roberts, A.B. (Eds.), 
-Peptide growth factors and their receptors" Springer- 

20 Verlag, Heidelberg 1990, p. 419). Under reducing 
conditions, major bands of 97 kDa and 107 kDa were seen 
with the short form of endoglin and bands of 102 kDa and 
112 kDa were observed with the long form. These doublets 
have been observed previously for endothelial cells and 

25 might represent endoglin bound to the monomer (12.5 kDa) or 
dimer (25 kDa) of TGF-fl 1 (Roberts, A.B. and Sporns, M.B. 
In.: Sporn, M.B. and Roberts, A.B. (Eds.), "Peptide growth 
factors and their receptors" Springer-Verlag , Heidelberg 
1990, p. 419.; Cheifetz, S. and Massague, J., J- P io ?-- 

30 Chem. 266:20767 (1991)). By subtracting the contribution 
of TGF-fl 1, one can estimate a molecular weight of 85 kDa 
for S-endoglin and 90 kDa for L-endpglin. 
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mSCUSSION 

Sequence analysis demonstrated the existence of 
two different cDNA variants named L-endoglin and S- 
endoglin. These two isoforms are coexpressed by myeloid 
5 cells, endothelium and placenta, although the majority of 
the transcripts synthesized apparently correspond to the L- 
endoglin isoform. The mechanism by which the two isoforms 
are generated remains to be determined. Most likely, both 
isoforms are generated by alternative splicing. In fact, 

10 consensus sequences of donor /acceptor sites (GT, AG) at the 
5 < and 3' ends and branch point of the lariat (CTGAC) , have 
been found on the novel 135 bp insert of S-endoglin cDNA 
(Figure 7). A siaiilar example of cytoplasmic variants 
generated by a -retained intron" splicing mechanism, have 

15 been recently reported for the activin receptor of the TGF- 
B receptor family (Attisano, L. et al., Ceil 68:97 (1992)). 

The behavior of the two endoglin isoforms was 
analyzed by transfection studies. Both forms are expressed 
on the cell surface as disulfide linked homodimers, 
20 indicating that the cysteine residues present in the 
extracellular region are responsible for the dimerization. 
in spite of the different cytoplasmic domains, both forms 
behave as TGF-B1 binding proteins. 

S-endoglin, L-endoglin and betaglycan contain 22 
25 identical residues in the transmembrane domain and the 
adjacent cytoplasmic region (Figure 9). This delineates a 
first conserved motif between betaglycan and the two forms 
of endoglin, which contains two tyrosine residues and two 
Ser/Thr residues whose phosphorylation status remain to be 
30 analyzed. A second region of high identity in the 
cytoplasmic tail is shared only by the long form of 
endoglin and betaglycan. The high content (40%) of Ser/Thr 
residues in this motif, and the absence of Tyr suggests 
that this region might undergo phosphorylation by a Ser/Thr 
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kinase. Interestingly, the cytoplasmic domain of the TGF-B 
receptor II, displays Ser/Thr kinase actxvxty (Lib. H.T. 
al., Cell 68:775 (1992)). 

Although the invention has been described with 
5 reference to the disclosed embodiments, it should _b 
understood that various modifications can be made without 
departing from the spirit of the invention. Accordingly, 
the invention is limited only by the following claims. 
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What is claimed is: 

1. An isolated nucleic acid molecule encoding 
an endoglin' polypeptide or a biologically active fragment 
thereof. 

2. A nucleic acid molecule encoding a soluble 
endoglin-derived polypeptide or a biologically active 
fragment thereof. 

3. The nucleic acid of claim 1 or 2, wherein 
the nucleic acid encodes a human endoglin polypeptide. 

4 . The nucleic acid molecule of claim 1 or 2 , 
wherein the nucleic acid is genomic DNA, cDKA, mRNA or 
cRNA. 

5. The nucleic acid molecule of claim 4, 
wherein the genomic DNA is designated CECT 4475. 

6. The nucleic acid molecule of claim 1 or 2 
operatively linked to a promoter of RNA transcription. 

7. A vector containing the nucleic acid 
molecule of claim 5. 

8. A host cell containing the vector of claim 

20 7. 

9. The host cell of claim 8, wherein the cell 
is a procaryotic cell or a eucaryotic cell. 
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10. A method of preparing an endoglin-derived 
polypeptide or active fragment thereof, comprising: 

a. inserting a nucleic acid molecule encoding 
an endoglin-derived polypeptide or active fragment thereof 

5 into a suitable expression vector; 

b. inserting the resulting vector into a 
suitable host cell; 



c . 



inducing the resulting host cell to express 
the endoglin-derived polypeptide or active fragment 
10 thereof; and 

d. purifying the resulting endoglin-derived 
polypeptide so produced. 

11. Soluble endoglin-derived polypeptide. 

12. Endoglin-derived polypeptide or a 
15 biologically active fragment thereof produced by the method 

of claim 10. 

13. A pharmaceutical composition comprising the 
polypeptide of claim 11 or 12 and a pharmaceutical^ 
acceptable carrier. 

14. A pharmaceutical composition comprising a 
pharmaceutical^ acceptable carrier and a purified and 
isolated human endoglin-derived polypeptide or a 
biologically active fragment thereof substantially free of 
other host cell proteins. 
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15. A method of modifying a biological functi n 
mediated by the regulatory activity of TGF-fl which 
comprises contacting TGF-B with an effective amount of an 
endoglin-derived polypeptide or a biologically actxv 

5 fragment thereof, thereby modifying the biological 
function . 

16. The method of claim 15, wherein TGF-fl is 

TGF-fl 1. 

17. The method of claim 15, wherein TGF-fl is 

10 TGF-B3. 

18. The method of claim 15, wherein the 
contacting is effected in vitro . 

19. The method of claim 15, wherein the 
contacting is effected in yive. 

20. The method of claim 15, wherein the 
regulatory activity is stimulation of cell proliferation or 
cell growth inhibition. 

21. The method of claim 15, wherein th 
regulatory activity is promotion of extracellular matrix 

2 0 production . 

22. The method of claim 15, wherein the 
regulatory activity is regulation of immune function. 

23. The method of claim 15, wherein th 
polypeptide is a purified human endoglin-derived 

25 polypeptide comprising a disulf ide-linked homodimer of 
about 80 to about 95 JtDa subunits or a biologically active 
fragment thereof. 
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24. The method of claim 23, wh rein the purified 
polypeptide is purified and isolated human endoglin-derived 
protein substantially free of other human proteins. 

25. The method of claim 15, wherein the 
polypeptide is a soluble endoglin-derived polypeptide. 

26. A method of treating a pathologic condition 
caused by TGF-fl regulated cell growth stimulation which 
comprises administering to a subject an effective amount of 
the pharmaceutical composition of claim 13 or 14 to bind to 
the TGF-A thereby treating the pathologic condition caus d 
by cell growth stimulation. 

27. The method of claim 26, wherein TGF-fl is 

TGF-fl 1. 

28. The method of claim 26, wherein TGF-fl is 

15 TGF-J53 . 

29. The method of claim 26, wherein the subject 
is a human patient. 

30. A method of treating a pathologic condition 
caused by TGF-fl regulated inhibition of cell growth which 

20 comprises administering to a subject an effective amount of 
the pharmaceutical composition of claim 13 or 14 to bind to 
the TGF-fl, thereby treating the pathologic condition caused 
by inhibition of cell growth. 

31. The method of claim 30, wherein TGF-fl is 

25 TGF-fl 1. 

32. The method of claim 30, wherein TGF-fl is 

TGF-A3 . 
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33. The method of claim 30, wherein the 
pathologic condition is ulceration or immune suppression. 

34. The method of claim 30, wherein the subject 
is a human patient. 

5 35. The method of treating a pathologic 

condition caused by TGF-fl regulated promotion of 
extracellular matrix accumulation which comprises 
administering to a subject an effective amount of the 
pharmaceutical composition of claim 12 or 13 to bind TGF-fl 
thereby treating the pathologic condition caused by 
promotion of extracellular matrix. 

36. The method of claim 35, wherein the subject 
is a human patient. 

37. The method of claim 35, wherein the 
15 pathologic condition is inflammation, rheumatoid arthritis, 

inflamed skin, lesions, scar tissue formation, lung 
fibrosis, liver fibrosis, atherosclerosis or 
glomerulonephritis . 

38. A method of inhibiting the activity of 
20 endoglin which comprises contacting endoglin with an 

effective amount of a polypeptide capable of binding to 
endoglin to bind endoglin, thereby inhibiting the activity 
of endoglin. 

39. The method of claim 38, wherein the 
25 polypeptide is a TGF-fl. 

40. The method of claim 38, wherein the 
polypeptide has an amino acid sequence corresponding to a 
fragment of a TGF-fl having the ability to bind endoglin. 
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41. The method of claim 39, wherein the TGF-fl is 
TGF-B1 or TGF-fl 3. 

42. The method of claim 40, wherein the TGF-fl is 
TGF-fl 1 or TGF-fl 3. 

5 43. The method of claim 38, wherein the 

polypeptide is an anti-endoglin antibody. 

44. The method of claim 43, wherein the antibody 
is a monoclonal antibody. 

45. The method of claim 38, wherein the 
10 contacting is effected in SiSS* 

46. The method of claim 38, wherein the 
contacting is effected in vitro . 
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FIGURE 4 
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FIGURE 5 
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CC TCC CCC CAC CAC TZZ TCC TCT CAT TCC CAT CO TTC CAC CCC ACC ACC CCC TCC CCC CCC ATC CTT CDC M 

ACA OCA ACT CCA OCT CAC CCT CCC CTC CTT CTT TCC ACT TCT CCT SAC CCC TCC CCC OOC ACC CCA CAA OOC TCC Ml 

AM ACC CAC CCC CTC CCT CT* CCC OCT TOC TCC CCT CCC CTC CCC CTC CCA CCC CAC CCC OOC CCC OCT CCC CDC Jl* 

CCC CAC CTC TCC CAC TOC ACA CAC CAT AAC CCC CAC COC ACA OOC CCC CAC CTC CAC ACC ATC CAC CDC OOC ACC l«« 

6 am 0w Am Aim *»i Aim Am Am Am Aim Mr o* am am Mr m» Vtw Mr Am Air 6I« Thr V»l ma Cyt 

CTC CCT CTC OCT CTT CCC CTC CTC CTC CCC ACC TCC ACC CTC ACC CCC ACA. ACT CTT OCA CAA ACA CTC CAT TCT >M 



581 civ cyt Thr mmt Lrm civ mw r.i am m Air vri Am Mr am nr im rjy Ala AM Am Air Oiy Air Am 

OCT TOC ACA ACC AAA CCC CTC CTC CTC CCC CCC CTC CTC OOC ATC ACC TTT OCT OCT TTC CTC ATC OCX OCC CTC 

606 am iw Air Air am TW9 T\: :ir Tvr %mw mi Tni rro Gl« Tyr rrr Arr rwm rrr CU 

CTCACTCrreCACTCTCCtACATCTACTCCCACACC rrrr r_*r w r-rr- ^ TTi rr I ^ rr^ ppp 2W| 

^rrirf irrnr nr.r 1-r m nr iir f>T r^r irr. -r»» t-rr- f-r? m r^f ^. r^-r f^r JJ«4 

t— e rrr rr- ^— ~ f«r m ser TCC CCC ACC AAC CCC CAC CCC CTC CTC OCC CTC OCT OCC CCC CCC TCC TCC i>J> 
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• • ?GGGGCCACCACTCCTGC?G7CACTGCCATCCATTGGAGCCCArc^ 
9 1 : CGCGTCCCTGTGTCCACTTCTCCTGACCCCTCGGCCGCCACCCCAGAAGGCTGGAGCAGGGACGCCGTCGCTCCGGCCSCC7GCTCCCCT 



* 81 : -CGGGTCCCCGTGCGAGCCCACGCCGGCCCCGGTGCCCGCCCGCAGCCCTGCCACTGGACACAGGATAAGGCCCAGCGCACACGCCCCCAC 

27* * C7GGACAGCATGGACCGCGGCACGCTCCCTCTGGCTCTTGCOT 

MetAspAxgGlyThrl^iiProUsuAlaValAlal-euLeuLeuAlaSeECyaSerLcuSerProIhrSccLeuAUCluThr 

361 : STCCATTGTGACCTTCAGCCTCTGGGCCCCGAGAGGGGCGAGGTGACATATACCA 
28: ValHi«CysAspl^uGlnProValGlyProGluArgGlyGluValThrTyrThrThrSerGlnVAlSerLysClyCy»VAlAAaClnAi^ 

451 : CCCAATGCCATCCTTGAAGTCCATGTCTTCTTCCra 
S8: ProAsnAlaIlel*uGluValHisValI*uPheUmGluPheProTh^ 

541- AATGGCACCTGGCCCCGAGAGGTGCTTCTGGTCCTCAG1GTAAACAGCAGTGTCTTCCTGCATC7CCAGGCCCTGGGAATCCCACTCCAC 
88: AanGlyThrTrpProJirgGiuValLeuI^uVaXUuSerVaXAsnSerSerVal 

631- W ^GCCTACAATTCCAGCCTGCTCACCITCCAAGAGCCCCCGGGGGTCAACACCACAGAGCTGCCATCC7TCCCCAAGACCCAGATCC^ 
118: LcuAlaTyrAsnSerSerLeuVal?hrPheGlnGiuProProGlyValAsnThrIhrCluLeuProSerPheProLy»ThrGlnIleLeu 

721 • GAGTCGGCAGCTGAGAGGGGCCCCATCACCTCTGCTGCTGAGCTGAATGACCCCCAGAGCATCCTCCTCCCACTCGGCCAAGCCCAGGGG 
148: GluTrpAlaAlaGluArgGlyProIleThrSerAlaAlaGlul^u 

178: SerSuH^eCyS^ 

901: CACTTGGAAGGCGTGGCCGGCCACAAGGAGGCGCACATCCTGAGGGTCCTGCCGGGCCACTCGCCCGCG«CCCCACC 

208: HiaLeuGluGlyValAlaClyHisLysGluAlaHisIleUiiArgVall^uProGlyHisSerAlaGlyProArglhrValThrValLys 

991 : GTGGAACTGAGCTGCGCACCCGGCGATCTCGATCCCGTCCTCATCCT^^ 

238: ValGlul^uSerCysAlaProGlyAspl^uAspAiaVaiLeuIleLeuGlnGlyProProtyrValSerTrpl*uIl«AspAlaAanHi3 

•081: AAC ATGC AG ATCTGGACCACTGGAGAAI ACTCCTTC AAGATCTTTCCAGAGAAAAACATTCGTGGCTTCAA^ 
"268: AsnltetGlnIleTrpThrTbrtlyGluTyrSerPbeLy«n^ 

y 171 " GGCC7CCTCGCGCAGGCCCGGATGCTCAATGCCAGCATTGTGGCAICCT 

" 2 98: Giyl^uLeuGXyGluAlaArgrtetl^uAsnAlaSerlleValAlaSerPheValGlu^^ 

1261 : TCCAGCTGCGGTGGTAGGCTGCAGACCTCACCCCCACCGATCCAGACCACTCCTCCCAAGGACW 
328: SerSerCysGlyGlyArgLeuGlnThrSerProAlaProIleGinlhrThrProProLysAspThrCysSerProGluLcuLeuMetSar 

1351: TTGATCCAGACAAAGTGTGCCGACGACCCCATGACCCIGCTACT AAACAAAGAGCITCTTGCGCATTTGA^TC^ACCATCACGCGOTC 
358: LeuIleGlnThrLysCysAlaAjipAspAlaMetThrl^uVall^uLyaLysGlul^uVamaHisLeuLyaCysThrlleThrGlyLeu 

1441- ACCTTCTGGGACCCCAGCTGTGAGGCAGAGGACACGGGTGAC AAGTTTGTCTTGCGCACTGCTrACTCCACCTCTGGCATCCAGCTCTCA 
3B8l ThrPheTrpAapProSerCv^luAlaGXuAspArgGXyAapLyaPheVaX^ 

1531: GC AAGTATGATCAGCAATG^CGGTCGTCAATATCCTCT^ 
418s AlaSerMetlXeserAsnGluAXaValVaXAsnIleLeuSerSerScrSerProGlnArgLysLy5ValHisCysLe\»AsnMetA5pSer 

1621: CTCTCTTTCCAGCTGGGCCTCTACCTCAGCCCACACrrCCTCCAGGCCTCCAACACCATCGAGCCGGGGCAGCAGAGCTTTCTGC 
4 46: LeuSerPheGlnLeuGXyLeuTyrLeuSerProHisPheLeuGXnAXaSerAsnThrlXeGXuProGXyGlnGXnS«rPb«VaXGXnVaX 

'711: AGAGTGTCCCCATCCGTCTCCCAGTTCCTGCTCCAGTTAGACACCTGCC 

*478: ArgValSerProSerVaXSerGXuPheUuI*uGXnLeuAspScrCyaHisLeuAspLeuCXyProGXuGXyGXyThrVaXGlul«uIXc 

1801 : CAGGGCCGGGCGGCCAAGGGCAACTGTGTGAGCCTGCTGTCCCtAAGCCCCG 
508: GlnGlyArgAlaiUaLysGXyAsnCysVaXSerLcuLeuScrProSerProGXuGXyAapProArgPheScrPheieuLeuHisPhcTyr 

1891 : AC AGTAC CC AT ACCC AAAACCGCC ACCCTC AGCTCCACX5GT AGCCCTGCGTCCC AAGACCGGCTCTC AAGACCAG 
£38: ThrVaXProlXeProLysThrGXyThrl.eu£crCysTbrVaXMal*uArgProLysThrGXySerGXnAspGXnGXuVaXHiaArgTBr 

1981 : CTCTTCATGCGCTTGAACATCATCAGCCCTGACCTGTCTGGTTGC^ 
568: ValPheMetArgl^iiAsnllcIXeScrProAspLeuSerGlyCysThrSerLysGlyLeuVaXLeuProAXaVaXLeuGXylXelbrPbe 

2071: GGTGCCTTCCTCATCGGGGCCCTCCTCACTGCTGCACTCTG 
598 x GlyAXaPheLeuIXeGlyAXaLcuLeuThrAXaAXaLeuTrpTyrlXeTyrSerHisThrArgSerProSerLyaArgGXuProVaXVaX 

2161: GCGGTGGCTGCCCCCGCCTCCTCGGAGAGCAGCAGCACCAACCACAGCATCCGCAGCACCCAGAGCACCCCCTGCTC 
628: AlaValAXaAXaProAlaSerSerGluS€rSerSerThrAsnHisSerlXeGXySerThrGXnSerThrProCysSerThrS©rS«tftet 

225X : GCATAGCCCCGGCCCCCCCCGCTCGCCCAGCACCAGAGACTGAG 

658 • A las cop ••■••••••••••••••••••■•••••••••••••••••••••••••••••••••••••••*••••••••"*"*"****""""" 

2341*: TCCTCC ACTCGACCCACAATGG AGCCTGCTCTCCCCCCC^ 

2431: TGGAACACCTTGGGGTCCCTCCACCCCAJCAGAACCTTCAACCCAGTGGGTCTGGGATATGGCTGCCCAGGAGACAGACCACTTG 

2521: TGnGTAAAAACCCAAGTCCCTCTCATTTCAACCTGGATC^ 

2611: AGCCC ACCCC AGCC ACGCC AAC AGC ACCTCCCCCC1GGG AAG AG AAG AGGGCCC AGCCC AG AC^CACCTGGATCT ATCCCTCCGGttTCC 

2701: ACACCTGAACTTGCCTAACTAACTGGCAGGGGACACAGGAGCCTAGCGGAGCCCACCCTG 

2791 : GTTC^AGCCTAGCTCCTCCCACATTCAGCCCCCTCTGCCGGTCGGGCAGCCAGCAGAGGGGCACTATO 

2881: GCCCCTCTGTATTC ACCACC AAT AAATC AG ACCATGAAACC AGTGAAAAAAAAAAAAA 
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